IN THE CLAIMS 



1. (Currently amended) An electronic device, comprising: 

a processor for executing an operating system program and a media content presentation 
program; 

a media content pickup device operatively connected to said processor, said media 
content pickup device arranged to capture media content input, and said media content pickup 
device arranged to automatically focus on a user- specified region of interest of the media 
content input without moving the media content pickup device; and 

an auto-focus mechanism of said media content pickup device arranged to 
automatically focus on said user-specified region of interest in response to a focus 
command; and 

a media output path to receive and to carry the focused media content input . 

2. (Previously presented) An electronic device as recited in claim 1, wherein the 
user- specified region of interest is specified by a user through interaction with a graphical user 
interface. 

3. (Previously presented) An electronic device as recited in claim 2, wherein the 
graphical user interface is provided by the media content presentation program that is executed 
by said processor. 

4. (Previously presented) An electronic device as recited in claim 2, wherein said 
media output path carries the focused media content input to be provided to a media output 
device, the media output device being part of said electronic device or separate from said 
electronic device. 

5. (original) An electronic device as recited in claim 4, 
wherein said media output device is a monitor, 
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wherein the graphical user interface is displayed on said monitor, and 

wherein the graphical user interface includes at least a media content display window. 

6. (Previously presented) An electronic device as recited in claim 5, wherein the 
user-specified region of interest is specified by the user with reference to the media content 
display window. 

7. (original) An electronic device as recited in claim 4, wherein said media output 
device is a monitor. 

8. (original) An electronic device as recited in claim 4, wherein said media output 
device is at least one speaker. 

9. (original) An electronic device as recited in claim 1, wherein the media content input 
is at least one of audio content or video content. 

10. (original) An electronic device as recited in claim 1, wherein said media content 
pickup device is at least one of a camera and a plurality of microphones. 

11. (original) An electronic device as recited in claim 1, wherein said electronic device 
is one of a mobile telephone, a personal computer, a personal digital assistant, and a handheld 
computer. 
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12. (Currently amended) A computer system, comprising: 
a processor for executing a video application program; 

a camera operatively connected to said processor, said camera arranged to capture video 
input in accordance with its field of view, and said camera arranged to automatically focus on a 
determined region of the field of view without moving the camera, the determined region being 
determined in accordance with a user input; and 

an auto-focus mechanism of said camera arranged to automatically focus on said 
determined region of the field of view in response to a focus command; and 

a data output means operatively connected to said processor, said data output means 
operates operating to provide the focused video input for display. 

13. (original) A computer system as recited in claim 12, wherein said processor receives 
a user input that indicates the determined region of the field of view. 

14. (original) A computer system as recited in claim 13, wherein the user input is with 
respect to a window displayed on said display. 

15. (original) A computer system as recited in claim 14, wherein the user input is a user 
selection of a region of the window. 

16. (Currently amended) A computer system as recited in claim 12 , wherein said computer 
further comprises further comprising : 

at least one microphone for sound pickup. 

17. (original) A computer system as recited in claim 16, wherein the video application 
program is an audio- video application, and wherein said processor receives the sound pickup 
from said at least one microphone and supplies audio output to a speaker. 
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18. (Currently amended) A computer system as recited in claim 17, wherein the speaker is 
coupled to and associated with said computer system . 



19. (Currently amended) A computer system as recited in claim 12 , wherein said computer 
further comprises further comprising : 

a plurality of microphones for sound pickup, said microphones having a known positional 
relationship to one another, 

wherein said microphones are integral with said camera. 

20. (original) A computer system as recited in claim 19, wherein said processor receives 
audio input from each of said microphones and processes the audio input to emphasize audio 
sound from the determined region that has been determined in accordance with the user input. 
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21. (currently amended) A method for altering a focus location for a camera coupled to a 
computing apparatus, said method comprising: 

receiving video input from the camera; 

displaying the video input in a video viewing window of a monitor; 

receiving an identification of a focus region that has been specified by a user by selecting 
an area of the video viewing window; and 

sending a focus command to an auto-focus mechanism of said camera; and 

causing the camera to focus on the focus region without moving the camera. 

22. (Canceled) 

23. (Canceled) 

24. (Previously presented) A method as recited in claim 21, wherein the user moves a 
curser image over the video viewing window using a pointing device to an area of interest, and 
then selects the focus region by clicking on the area of interest. 

25. (original) A method as recited in claim 24, wherein the user performs a button press 
to select the focus region. 

26. (original) A method as recited in claim 25, wherein the button press is with respect 
to a pointing device. 

27. (original) A method as recited in claim 26, wherein the pointing device is a mouse, 
trackball or a trackpad. 
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28. (Previously presented) A method as recited in claim 21, wherein the user moves a 
position reference image over the video viewing window using a pointing device to an area of 
interest, and then selects the focus region by clicking on the area of interest. 

29. (original) A method as recited in claim 21, wherein the focus region is an area of 
interest specified by the user. 

30. (Currently amended) A method as recited in claim 21, wherein said receiving of the 
audio video input is supplied from a first computing apparatus to a second computing apparatus, 
and said displaying of the video input and said receiving of the focus region are performed on the 
second computing apparatus. 

31. (original) A method as recited in claim 21, wherein the computing apparatus is one 
of a mobile telephone, a personal computer, a personal digital assistant, and a handheld 
computer. 
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32. (Currently amended) A method for using a computing apparatus to process audio input 
provided by a plurality of microphones, said method comprising: 

receiving audio input from the plurality of microphones; 

receiving an indication of a region of interest from a user with respect to a graphical user 
interface window being displayed on a monitor available to the user; and 

using digital signal processing to process the audio input to target the audio input 
towards the region of interest without moving said microphones . 

33. (original) A method as recited in claim 32, wherein said method further comprises: 
outputting the processed audio input to at least one speaker. 

34. (original) A method as recited in claim 33, wherein said method further comprises: 

repeating the foregoing operations after said outputting has output the processed audio 
input to the at least one speaker. 

35. (original) A method as recited in claim 32, wherein said processing captures audio 
from the region of interest while attempting to reject audio from other regions. 

36. (original) A method as recited in claim 32, wherein said processing utilizes beam 
forming and beam steering operations. 

37. (Currently amended) A method as recited in claim 32, wherein a camera couples to the 
computer computing apparatus , and 

wherein the camera has a housing and the microphones are internal to the housing of the 
camera. 

38. (original) A method as recited in claim 32, wherein the user performs a button press 
to select the region of interest. 
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39. (original) A method as recited in claim 38, wherein the button press is with respect 
to a pointing device. 

40. (original) A method as recited in claim 39, wherein the pointing device is a mouse, 
trackball or a trackpad. 

41. (original) A method as recited in claim 32, wherein the user moves a position 
reference image over the graphical user interface window using a pointing device to an area of 
interest, and then selects the region of interest by clicking on the area of interest. 

42. (original) A method as recited in claim 32, wherein said receiving of the audio input 
is supplied from a first computing apparatus to a second computing apparatus, and said 
displaying of the graphical user interface window and said receiving of the indication of the 
region of interest are performed on the second computing apparatus. 

43. (original) A method as recited in claim 32, wherein the computing apparatus is one 
of a mobile telephone, a personal computer, a personal digital assistant, and a handheld 
computer. 



9 



44. (Currently amended) A video conferencing system operable over a network, said video 
conferencing system comprising: 

a first computer system including at least a first processor for executing a first operating 
system program and a first video application program, a first camera to capture first video input, 
and a first monitor; and 

a second computer system operatively connectable to said first computer system via the 
network, said second computer system including at least a second processor for executing a 
second operating system program and a second video application program, a second camera to 
capture video input, and a second monitor; 

an auto-focus mechanism of said second camera arranged to automatically focus on 
a selected region of interest; 

wherein when said first computer system and said second computer system are involved 
in a video conference, said first monitor displays the second video input provided by said second 
camera via the network, and said second monitor displays the first video input provided by said 
first camera via the network, and 

wherein when a first user interacts with a window including the second video input 
presented on said first monitor to select a said region of interest with respect to the second video 
input, said second camera then automatically focuses itself using said auto-focus mechanism so 
that the second video input is focused on the region of interest without moving said second 
camera. 

45. (Canceled) 

46. (Previously presented) A video conferencing system as recited in claim 44, 
wherein the first user interfaces with the first graphical user interface by moving a graphical 
indicator over the window to identify the region of interest and then indicating its selection. 

47. (original) A video conferencing system as recited in claim 44, 

wherein said first computer system further includes at least a first plurality of 
microphones and a first speaker, 
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wherein said second computer system further includes at least a second plurality of 
microphones and a second speaker, 

wherein second audio input obtained by said second plurality of microphones is provided 
to said first computer system via the network and then output to said first speaker, 

wherein first audio input obtained by said first plurality of microphones is provided to 
said second computer system via the network and then output to said second speaker, and 

wherein said second multimedia computer system performs processing on the second 
audio input based on the region of interest selected by the first user, whereby the second audio 
input is processed so as to emphasize audio sound from the region of interest. 

48. (original) A video conferencing system as recited in claim 44, 

wherein said first plurality of microphones are internal to a housing of said first camera, 

and 

wherein said second plurality of microphones are internal to a housing of said second 
camera. 
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49. (Currently amended) A computer readable medium including at least computer 
program code for directing media content input, said computer readable medium comprising: 

computer program code for receiving media content input from a media content capturing 

device; 

computer program code for receiving a user- specified region of interest for the media 
content input; 

computer program code for processing a media content input into a media content display 
window based on a said user- specified region of interest without moving said media content 
capturing device, wherein the user- specified region of interest is specified by the user by 
selecting a region within the media content display window; and 

computer program code for providing the media content display window to a graphical 
user interface of a monitor. 

50. (Canceled) 

51. (Canceled) 

52. (Canceled) 

53. (original) A computer readable medium as recited in claim 49, wherein the media 
content input is at least one of audio content or video content. 

54. (original) A computer readable medium as recited in claim 49, wherein said media 
content pickup device is at least one of a camera and a plurality of microphones. 

55. (Previously presented) A computer readable medium as recited in claim 49, 
wherein the monitor includes at least one speaker. 
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Please add the following new claims: 

56. (New) An electronic device as recited in claim 1 wherein said auto-focus 
mechanism is arranged to automatically focus using position coordinates that identify said user- 
specified region of interest. 

57. (New) A computer system as recited in claim 12 wherein said auto-focus 
mechanism is arranged to automatically focus using position coordinates that identify said 
determined region of the field of view. 

58. (New) A method as recited in claim 21 further comprising: 

sending position coordinates identifying said focus region to said auto-focus mechanism. 

59. (New) A method as recited in claim 32 further comprising: 

using position coordinates identifying said region of interest to target the audio input 
towards the region of interest. 

60. (New) A video conferencing system as recited in claim 44 wherein said auto- 
focus mechanism is arranged to automatically focus using position coordinates that identify said 
selected region of interest. 

61 . (New) A computer readable medium as recited in claim 49 further comprising: 

computer program code for using position coordinates identifying said user-specified 
region of interest to process said media content input into a media content display window. 
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